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1 . SCOPE 


This specification establishes the preliminary design for 
two axis color coded spectral plots to be output on a multi- 
file Universal formatted tape. 

The location of each pixel on the two axis scatter plot will 
be computed using the radiance values or two linear combina- 
tions of radiance values. The color of the pixel on the scatter 
plot may optionally be set in one of the following ways: 

1. The original radiance value of the pixel. 

2. The mean value of the cluster/subclass that the pixel 
was assigned to during clustering/classification. 

3. The mean value of the field the pixel was extracted from 
for training/tes t fields. 

4. User defined colors, 

5. From a different passwhen using multi-registered 
Landsat data. 

Offset factors and scale factors may be used for centering 
the data on the scatter plot. The background may be set to 
black or may default to white. 

The channels will be output on the tape as follows: 

1. Channels 1 thru N-l will be the color of the pixel. 

2. Channel N will be the number of occurrences of the pixel. 


A color key will be provided when the color is assigned to a 
pixel by the field mean, cluster mean, or classification re- 


suits. 


The secondary function is to retrieve information relating to 
the color codes from the cluster/subclass/field number, image 
data tape, or mean statistics for the test/training fields. 

If cluster/subclass number is requested, a cluster/classifi- 
cation map tape must be input. The maptape is read into core 
_and stored on high speed drum. When a new data vector is en- 
countered during the histogramming processor, the cluster/ 
subclass number for that pixel is retrieved from the drum. 


If the field number is requested, the program supplies the 
field number. If the mean statistics for the fields are 
requested, the program computes the means of each input field 
and supplies the field number. 

If the color codes are to be extracted from the imagery data 
tape, the user may select a maximum of four channels. From 
1 to 4 radiance values will be packed into one computer word. 

For the class level grouping, a file is written each time all 
the fields within a class are processed; for the subclass 
level grouping, each time all the fields for one subclass are 
processed; and for the per field level, each time a field is 
processed . 

A line printer pixel frequency scatter plot may be selected. 
The location of the pixel is determined by its radiance values 
the frequency of occurrence or log of frequency of occurrence 
will be represented by increasing grey levels. A key relating 


the grey levels to the number of occurrences of a pixel will 
be printed. 

The above specifications will be implemented into EOD-LARSYS 
by authorization of Action Document 63-0997-1695-09 in support 
of Research Test and Evaluation in Building 17/JSC. 

This document assumes the reader is familiar with the EOD- 
LARSYS system and the terminology used in the pattern recog- 
nition system. 


2 . APPLICABLE DOCUMENTS 


The following documents, of exact issue shown, form a part of 
this specification to the extent specified herein: 

• User Documentation EOD-LARSYS , Lockheed Electronics Company, 
Inc., HASD, Houston, Texas, November 1975 

• Action Document 63-0997-1695-09 


3. SYSTEM DESCRIPTION 


3.1 HARDWARE DESCRIPTION 

N/A 

3 . 2 SO FTWARE DESCRIPTION 

The batch oriented, pattern recognition system EOD-LARSYS is 
operational on the UNIVAC 1108, Executive II version. At 
present, the system is composed of nine processors. A pro- 
cessor is defined as an executable program that performs one 
main function. 

To implement the requirements outlined in section 1, two 
new processors will be added to the existing system. One 
processor, NDHIST, will perform an N-dimensional histogram 
of the imagery data (n<.16) and output a file to be used as an 
interface to the processor, SCTRPL. The other processor, 
SCTRPL, will read the N-dimensional histogram file and output 
two axis color coded spectral plots on a multi -file Universal 
formatted tape. 


3.2.1 SOFTWARE COMPONENT NO. 1 (NDHIST) 


The NDHIST processor will perform an N-dimensional histogram 
of data vectors extracted from the MSS data tape. There are 
two sets of channels, S^ and S 2 , used in the computations of 
histogramming the data. The procedure is as follows: 


1 . 


Given S 1 ={n 1 ,n 2 , . . .n^.} and S 2 ={ }, the frequency of 
occurrence for each unique data vector is computed for 


channels n, , n. 


• n i' 


Given S r = {n., ,n 7 , 


i.^m‘ 2 * • • ,n i^ an£ * S 2 = t m i* m 2 » • • * m j ^ * t * ie number 
of unique data vectors accumulated is determined by 


channels n^,n 2 


, . . .n^, 


and the frequency of occurrence 


associated with each unique data vector, is computed 
using channels m^, m 2 ...,mj. Given and y^ are dat 
vectors determined by sets S^ and S 2 respectively, the 
frequency is calculated in one of three ways: 


a. 


If for each x^, there is only one y^, the frequency 
is calculated on each occurrence of y^. 

If for a given x^, there is more than one y^, the 
frequency is calculated on the first y^ encountered, 
The other y^’s are ignored. 

Example : 

Given 

x 1 = {24, 23, 22, 10} 
and 

x 1 = {24, 23, 22, 10} 

the frequency count associated with x^ is 1; the 
vector y 2 is thrown out. 

If for a given y^, there are more than one x^, the 
frequency for y^ is calculated based on its associa- 
tion with a particular x^. 


{41, 42, 43, 20} 


y 2 = {41, 40, 42, 19}, 
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Example: 

Given 

x x - {24, 24, 25, 12} y 1 - {44, 44, 45, 21} 
and 

x 2 *» {26, 24, 26, 10} y ^ * {44, 44, 45, 21}, 

the frequency count associated with x^ is 1; the 
frequency count associated with x^ is 1. 

The areas to histogram will be defined by test/training 
fields. The manner in which the fields are collected or 
grouped for histogramming is user controlled by input 
parameters. The data vectors may be collectively histogrammed 
on class, subclass, or per field level. The maximum number 
of fields input on any level is 200, and the maximum number 
of unique data vectors accumulated on any .level is 12,000/ 

(1/4 number of channels). (Maximum of four radiance values 
per computer word are packed together; each radiance value 
is stored in nine bits). 

3. 2 . 1 . 1 Linkages 

The processor NDHIST will use the Fortran-V compiler, Univac 
software system routines, and EOD-LARSYS utility routines. 

3. 2. 1.2 Interfaces 

A map tape may optionally be input. If created by ISOCLS, 
the map tape will contain the cluster images; if created by 
CLASSIFY, the classification images. See section D of Appendices 
in the EOD-LARSYS document for the format of the map tape. 

3 . 2 . 1 . 3 Inputs 

An imagery data tape must be input. The format must be either 
Universal or LARSYS II. See section A and B of Appendices 
in EOD-LARSYS document for tape formats. 

The card inputs consists of three types of cards: processor, 

control and field cards. 



? 
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The processor card is as follows with the keyword starting in 
column 1 ; 

Keyword Function 

$NDHIST Loads all the routines needed 

by this processor into the system. 


With the keywords starting in column 1, the parameters/ 
values starting in any column past 10, and m and n being 
integer values, the format of the control cards is as follows: 


Keyword 

CHANNEL 


DATAFILE 


MAPFILE 


Parameter 


Function 


PLOT®^ ,n 2 , . . ,n N , 
COLOR=m 1 ,m 2 , . . .m M 
(Default: For 

PLOT channels, no 
default; for COLOR 
channels, first 4 
PLOT channels, 
when applicable) . 


UNIT=n,FILE=m 
(Default: n=3 

m=l) 


UNIT=n , FILE=m 
(Default: NONE) 


n^ are the channels for 
determining the position 
of the pixel (i£l6) . Mj 
are the channels for the 
color codes (j<4) . 

If mj/{ }, the histogram 
is calculated using the 
COLOR channels; if mj ={ }, 
the histogram is calculated 
using the PLOT channels. 

n is the logical unit 
number assigned to the 
imagery data tape, m is 
the file number of the data 
to process, 
n is the logical unit 

number assigned to the 
cluster/classification 
map tape, m is the file 
number of the data to 
process. (The order of 
the fields to be histo- 
grammed must correspond 
to the order of the clustered/ 
classified fields on the 
input map tape) 


Parameter 


Function 


Keyword 

HISFILE 


OPTION 


OPTION 


OPTION 


OPTION 


HED1 


HED2 


DATE 


UNIT*n n is the logical unit 
(Default: n**4) number assigned to the 

n-dimensional histogram 
file output by this pro- 
cessor* 


CLASS 

(Default: Field 

bases) 


Fields will be histogrammed 
on class bases. 


SUBCLS Fields will be histogrammed 

(Default: Field on subclass bases, 

bases) 


FIELD 

(Default: Field 

bases) 


Fields will be histogrammed 
on per field bases. 


MEANS The means of the field(s) 

will be computed and out- 
put on the n-dimensional 
histogram file. 


Any 60 characters First line of the heading 
(Default: Lyndon on line printer output. 

B. Johnson Space 
Center) 


Any 60 characters Second line of the heading 
(Default: Houston, on line printer output. 

Texas) 


Any 12 characters Date in the heading on 
(Default: Present line printer output, 

date) 


Keyword 

Parameter 

Function 

COMMENT 

Any 60 characters 

Comment printed with head- 


(Default: NONE) 

ing on line printer output. 

*END* 


Indicates the end of the 



control card inputs. 

$END* 


Indicates the end of all 



the card inputs. 


The field card(s) define the area(s) to be histogrammed and 
the OPTION control card determines the level of histogramming. 
The fields may be ordered in one of >ur ways: 

a. As input to STAT 

b. As input to ISOCLS 

c. As input to CLASSIFY 

d. As test/training fields 

Example: 

CLASS NAME WHEAT 

SUBCLASS WHEAT1 

field card 1 
field card 2 
SUBCLASS WHEAT 2 

field card 3 
CLASSNAME NONWHT 

SUBCLASS NONWH1 

field card 4 
SUBCLASS N0NWH2 

field card 5 
SUBCLASS NONWH3 

field card 6 
field card 7 



If the histogram is accumulated on class bases, fields 1, 2 
and 3 are histogrammed collectively and output as data file 1; 
fields 4, 5, 6 and 7 are histogrammed collectively and output 
as data file 2. 

If the histogram is accumulated on subclass bases, fields 1 
and 2 are histogrammed collectively and output as data file 1; 
field 3 is histogrammed and output as data file 2; field 4 
is histogrammed and output as data file 3; field 5 is histo- 
grammed and output as data file 4; fields 6 and 7 are histo- 
grammed collectively and output as data file 5. 

If the histogram is performed on per field bases, each field 
is histogrammed separately and output to a file. Making a 
total of 7 data files created. 

On an accumulative histogram, a maximum of 200 fields may 
be input. 

See section 3.1.3 in EOD-LARSYS document for format of field 
card. 

3 . 2.1.4 Outputs 

A multi-file tape is output as an interface to the scatter 
plot processor, SCTRPL. The file(s) are created with an 
unformatted Fortran write. The number of records per file 
depends on the program options selected by the user. No file 
skipping capability is available; the first file created is 
always file 1. The format of the tape is as follows: 


file 1 


HEADER RECORD 
E-O-F 


data file 


♦ 

data file 
The contents 


TOTMNS 

SIZE 

NOFET2 

(FETVC2 (I) , 1= 
NCLRCH 

(CLRVEC(I) ,1= 


NOFLD2 
NOSUB 2 
TOTUT2 
NOVEC 


CLSVC2 

(SUBVC2 (I) , I» 
( (FIELDS(I , J) 
( (VERTEX (I , J) 


1 


/record 1 

I RECORD 2 
) RECORD 3 
/ RECORD 4 
\ RECORD 5 
} RECORD 6 
RECORD 7 
V E-O-F 


(optional) 


(optional) 

t 


N E-O-F 


of each record is as follows: 


Header Record 

- Total number of means computed 

- NOFHT2/4 

- Number of channels to histogram 

1,N0FET2) - Actual channels to histogram 

- Number of color code channels 
1, NCLRCH) - Actual color code channels 


Record 1 

- Number of fields histogrammed 

- Number of subclasses histogrammed 

- Number of vertices 

- Number of unique vectors histogrammed 


Record 2 


1 ,N0SUB2 ) 

,1-1,4) , J=1 ,N0FLD2) 
,1=1,2) ,J=1,T0TVT2) 


Class name 
Subclass names 
Field information 
Field vertices 


Record 3 (optional) 


(MEANS (I) , 1=1 ,TOTMNS) - Mean stats for input fields 

Record 4 

((PLOT(I,J) ,1=1, SIZE) ,J=l,NOVEC) - Data vectors 

Record 5 

( I D ( I ) ,1*1 ,NOVEC) - Class/subclass/field the data vectors 

belong to 

Record 6 

(COUNTR(I) , 1=1 ,NOVEC) - Number of occurrences of the data 

vectors 

Record 7 (optional) 

(COLOR(I) , 1=1 ,NOVEC) - Color codes extracted from MSS data 

tape 

As line printer output, the following information will be 
printed: 

a. Summary of program options selected 

b. For each data file, identification information such as 
class name, subclass name, field name and field vertices 
for each input field; the number of unique data vectors 
found; and the mean stats for each input field. 

3. 2.1. 5 Storage Requirements 

By the complexity cf the LARSYS program structure, several 
routines reside in core at all times. With the addition of 
this processor and the resident routines, 53K words of storage 
is required to execute this processor. 


3 . 2 , 1 . 6 Description 

With nine bits constituting a byte, up to four radiance values 
are stored into one Univac computer word. A character string 
comparison is used in histogramming the data vectors such 
that 

x i “ ' i 

if, and only if 

N N 

^ word^ = ^ ^ wordj 

i=l j =1 

where word contains the packed data, i^j , and N<4 


The field mean statistics are computed accumulatively for 
channel i by the equation 

U i =[(N-lJ*,. i _ 1 +x k>l 3/N 

where 

N = number of pixels in the field. 
x k i = ra< ^ ance va l ue for pixel k, channel i 
= previously computed mean for channel i 

u . = new mean for channel i 

i 












3.2.2 SOFTWARE COMPONENT NO. 2 (SCTRPL) 


The SCTRPL processor reads the n-dimensional histogram file 
written by NDHIST, determines the line bin level and sample 
bin level for each unique data vector, and outputs the 
spectral plot in the Universal format. For each file pro- 
cessed on the n-dimensional histogram file, a scatter plot 
is created and output to tape. 

The n-dimensional histogram file is read into core, then 
stored on high speed drum. Except when transforming the data 
vectors or sorting the data vectors in descending order, the 
information from the histogram file is directly accessed from 
drum as needed. 

The data vectors are ordered according to the second coordinate 
The second coordinate is the line number for the pixel; the 
first coordinate, the sample number. 

The scatter plot is created line by line. All the pixels 
belonging to a particular line are collected, and the colors 
associated with these pixels and number of occurrences of the 
colors for these pixels are output to tape as channels 1 through 
N respectively in the sample location determined by the pixel's 
first coordinate. 

A secondary function is to optionally output to the line 
printer a pixel frequency scatter plot. The frequencey of 
occurrence or log of frequency of occurrence will be represented 
by a symbol. The location of the symbol on the plot will be 
determined by the radiance values of the pixel. If the data 
has been transformed, then the data must be rescale to exercise 
this option. 


3. 2. 2.1 Linkages 
Same as section 3. 2. 1.1. 


3. 2. 2. 2 Interfaces 


The n- dimensional histogram file created by NDHIST must be 
input. See section 3. 2, 1.4 for format of the tape. 

The statistics file created by STAT/ISOCLS may optionally 
be input. See section 4.1 in EOD-LARSYS document for de- 
scription of file. 

3. 2 . 2 . 3 Inputs 

The card inputs consists of two types of cards: processor 

and control cards. 

The processor card is as follows with the keyword starting in 
column 1: 

Keyword Function 

$SCTRPL Loads all the routines needed \>y 

this processor into the system. 

With the keywords starting in column 1, the parameters/ 
values starting in any column past 10, and m and n being 
integer values, the format of the control cards is as follows: 


Keyword 

Parameter 

Function 

CHANNEL 

**1 ,n 2 ' * ,n N 
(Default: First 

4 channels from 
n-dimensional 
histogram file) 

Statistics for these 
channels will be extracted 
from the stat file. n N 
must be a subset of channels 
on the stat file. 

STATFILE 

UNIT=n , FILE=m 
(Default: NONE) 

n is the logical unit 
number assigned to the 
stat file. 

m is the number of the file 
to process, 



Parameter 


Function 


Keywor d 

MISFILE 


PtXPLT 


PIXPLT 


PIXPLT 


COLOR 


UNIT*n 

(Default: n»4) 


LOG 


FREQ 


RESCALE 
(Default: No 

rescaling. 
XSIZ=101,YSIZ=101 
The range for 
X-axis is XLO+ 
XSIZ-1; the range 
for y-axis, YLO+ 
YSIZ-1) . 

(m x ) , (m 2 ) . . . (n» N ) 
or 

L* (m x ) , K* (m L+ x ) 

L and K are inte- 
ger repetition 
factors. 


n is the logical unit 
number assigned to the 
n-dimensional histogram 
file. 


Line printer pixel scatter 
plot of the log of fre- 
quency of occurrences will 
be printed. 


Line printer pixel scatter 
plot of the frequency of 
occurrence will be printed. 

The frequency of occurrences 
of the pixel for the line 
printer scatter plot will 
be scaled to the range XLO, 
XHI , YLO and Yfll . XSIZ 
will determine the number 
of bins on the x-axis; 

YSIZ, the number of bins 
on the y-axis. (See SIZE 
control cards) . 


m l = n l’ n 2 M 


.n„ - colors 
M 

assigned to channels 1, 

2 . . .M for cluster 1 . 


. . . n,. - colors 
M 


m 2 = n l ,n 2 
assigned to channels 1, 

2 . . ,M for cluster 2 . 


m^ = n^.n^-.n^ - colors 
assigned to channels 1, 

2 . . .M for cluster N . 


Keyword 

Parameter 

Function 

SIZE 

XSIZ-n 

(Default: 

XSIZ-101) 

The number of samples 
per line to output on 
the scatter plot tape. 
n<200 

SIZE 

YSIZ=n 

(Default: 

YSIZ*101) 

The number of lines to 
output on the scatter 
plot tape. n;<200 

SIZE 

XHI=n 

(Default: 

XHI^lOO) 

The upper limit of the 
radiance value for the 
sample axis (x-axis) 
n<255 

SIZE 

XLO=n 
(Default : 
XLO=0 

The lower limit of the 
radiance value for the 
sample axis (x-axis) 
0<n<XHI. 

SIZE 

YHI=n 

(Default: 

YHI=100) 

The upper limit of the 
radiance value for the 
line axis (y-axis) 
n<25 5 


YLO=n 
(Default : 
YLO=0) 

The lower limit of the 
radiance value for the 
line axis (y-axis) . 
0_<n< 2 5 5 

PLOTAP 

UNIT=n 

(Default: n=7) 

n is the logical unit 
number assigned to the 
spectral plot tape. 


1 



Keyword 

Parameter 

Function 

B-MATRIX 

CARDS 

(Default: NONE) 

CARDS - B-Matrix is being 
input by cards. 

B-MATRIX 

FILE 

Flt£ - B-Matrix is being 
input by file. 

BVEC 

t l» t 2* * ' *N 

Elements of the additive 

vector to be used in the 

transformation, t. are 

1 

floating pt. numbers. 

BCKGND 

n 

(Default: n,25S) 

n = 0 - background will 
be black. 

n * 255 - background will 
be white. 

SCALE 

FILE 

The scale factors will be 
computed from the n-dimen- 
sional histogram file. 

SCALE 

XMAX- t 

(Default: XMAX 

will be computed 
from the n-dimen- 
sional histogram 
filp) • 

The upper range for the 
transformation of the sampl 
values (x-axis) . t is a 
floating pt. number. 

See note on next page. 

SCALE 

XMIN-t 

(Default: same 

as above) 

The lower range for the 
transformation of the 
sample values (x-axis) . 


t is a floating pt. 
number . 



Keyword 

Parameter 


Function 

SCALE 

XMIN-t 
(Default: 
as above) 

same 

The lower range for the 
transformation of the 
sample values (x-axis). 
t is a floating pt. number. 

SCALE 

YMAX«t 
(Default : 
as above) 

same 

The upper range for the 
transformation of the 
line values (y-axis). 
t is a floating pt. number. 

SCALE 

YMIN*t 
(Default: 
as above) 

same 

The lower range for the 
transformation of the 
line values (y-axis) . 
t is a floating pt. number. 

SCALE 

RESCALE 
(Default: No 

rescaling of 
the transformed 
data) 

The transformed data will 
be rescaled to the range 
of XLO, XHI , YLO and YHI. 
(See SIZE control card) 

DATE 

COMMENT 

HED1 

HED2 



See Section 3. 2.1. 3. 


Note: If one of the parameters XMIN, XMAX, YMIN or YMAX is 

input, then all four parameters must be input. 


3. 2. 2. 4 Outputs 

A multi-file Universal formatted tape containing the spectral 
plots and color key, when applicable, will be output. The 
spectral plot images will be the first N lines, followed by 
M number of blank lines, followed by the color key. See section 
B of Appendices in EOD-LARSYS document for format of tape. 

As line printer output the following information will be 
printed: 

a. Summary of the options selected. 

b. For each file output to tape, identification information 
such as class name, subclass name, and field information 
relating to the spectral plot. 

c. Optionally, a line priniej pixel frequency scatter plot. 

3 . 2 . 2 . 5 Storage Requirements 
Same as section 3. 2. 1.5. 

3 . 2 . 2 . 6 Description 

If the data vector is to be transformed or reduced to two 
linear combinations of channels, the following equations and 
conditions will be applied: 

y = Bx + c 

where y is a 2x1 vector 

B is a 2xn vector, n<.16 
x is a nxl vector, n<16 
c is a 2x1 vector 

If the transformed data is to be rescaled, the following 
equation will be applied: 


y. -[(HI 1 -L0 i )/RAN6B i ]*|MIN i -Z i | 


where HI^ * an inpat parameter for the upper limit of the 
bin level for channel i, 

LO^ * an input parameter for the lower limit of the 
bin level for channel i, 

MIN^- minimum value for channels i. 

- transformed data point for channel i. 

RANGE^ - range (max^-min,^) for channel i. 

y. - rescaled data value for channel i. 

' i 



3 .2.2,7 Flowchart 




OUTPUT 
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